Skip to content

Add nvidia provider with 15 models#48

Open
TF0rd wants to merge 1 commit into
mnfst:mainfrom
TF0rd:add-nvidia-models
Open

Add nvidia provider with 15 models#48
TF0rd wants to merge 1 commit into
mnfst:mainfrom
TF0rd:add-nvidia-models

Conversation

@TF0rd

@TF0rd TF0rd commented Jun 7, 2026

Copy link
Copy Markdown

Type of change

  • Add a model
  • Add or fix parameters

Summary

Adds the nvidia provider with 15 model entries sourced from the official NVIDIA NIM API reference at docs.api.nvidia.com/nim/reference/llm-apis#nvidia.

Each model's parameters were scraped from its -infer endpoint documentation (e.g. nemotron-3-ultra-550b-a55b-infer).

Models added

Model Notable params
nemotron-3-ultra-550b-a55b reasoning_effort (none/medium/high), reasoning_budget
nemotron-3-super-120b-a12b reasoning_effort (none/low/high), reasoning_budget
nemotron-3-nano-30b-a3b temp, top_p, max_tokens, seed, stop
nemotron-mini-4b-instruct tools, freq/presence penalties
nemotron-content-safety-reasoning-4b temp, top_p, max_tokens, seed, stop
llama-3.1-nemotron-nano-8b-v1 full OpenAI-compatible params + seed
llama-3.1-nemotron-ultra-253b-v1 full OpenAI-compatible params + seed
llama-3.3-nemotron-super-49b-v1 full OpenAI-compatible params + seed
llama-3.3-nemotron-super-49b-v1.5 max_tokens up to 65536
llama-3.1-nemotron-safety-guard-8b-v3 temperature only (0-1, default 0)
llama-3.1-nemoguard-8b-topic-control temp 0-2, top_p, freq/presence penalties
riva-translate-4b-instruct-v1.1 translation model, default temp 0
usdcode-llama-3.1-70b-instruct expert_type enum (auto/code/knowledge/helperfunction)
gliner-pii labels, threshold, chunk_length, overlap, flat_ner
nemoguard-jailbreak-detect input param (security endpoint)

Models omitted

Two nvidia/ models were not added because their infer pages document only stream (a reserved MPS path that cannot be a parameter):

  • llama-3.1-nemoguard-8b-content-safety — only stream, accept, model, messages
  • nvidia-nemotron-nano-9b-v2 — infer page appears buggy (serves content-safety model data)

Checks

  • npm run validate — OK (189 models)
  • npm test — 101/101 passed
  • npm run guard:params — no removals

Docs

Add model parameter catalog entries for all NVIDIA NIM API models
(provider: nvidia, authType: api_key) with parameters sourced from
the official NVIDIA API reference at docs.api.nvidia.com.

Models added:
- nemotron-3-ultra-550b-a55b (reasoning_effort, reasoning_budget)
- nemotron-3-super-120b-a12b (reasoning_effort, reasoning_budget)
- nemotron-3-nano-30b-a3b
- nemotron-mini-4b-instruct (tools support)
- nemotron-content-safety-reasoning-4b
- llama-3.1-nemotron-nano-8b-v1
- llama-3.1-nemotron-ultra-253b-v1
- llama-3.3-nemotron-super-49b-v1
- llama-3.3-nemotron-super-49b-v1.5 (65536 max_tokens)
- llama-3.1-nemotron-safety-guard-8b-v3
- llama-3.1-nemoguard-8b-topic-control
- riva-translate-4b-instruct-v1.1
- usdcode-llama-3.1-70b-instruct (expert_type enum)
- gliner-pii (entity extraction: labels, threshold, chunk_length,
  overlap, flat_ner)
- nemoguard-jailbreak-detect

Two models omitted — their infer pages document only stream
(a reserved MPS path not eligible as a parameter):
- llama-3.1-nemoguard-8b-content-safety
- nvidia-nemotron-nano-9b-v2 (infer page appears buggy; serves
  content-safety model data)
@vercel

vercel Bot commented Jun 7, 2026

Copy link
Copy Markdown

@TF0rd is attempting to deploy a commit to the Manifest Team on Vercel.

A member of the Team first needs to authorize it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

model Add a model that's missing

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant